Advances in Acoustic Modeling for the Recognition of Czech

نویسندگان

Jirí Kopecký

Ondrej Glembek

Martin Karafiát

چکیده

This paper presents recent advances in Automatic Speech Recognition for the Czech Language. Improvements were achieved both in acoustic and language modeling. We mainly aim on the acoustic part of the issue. The results are presented in two contexts, the lecture recognition and SpeeCon+Temic test set. The paper shows the impact of using advanced modeling techniques such as HLDA, VTLN and CMLLR. On the lecture test set, we show that training acoustic models using word networks together with the pronunciation dictionary gives about 4-5% absolute performance improvement as opposed to using direct phonetic transcriptions. An effect of incorporating the ”schwa” phoneme in the training phase shows a slight improvement.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Allophone-based acoustic modeling for Persian phoneme recognition

Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...

متن کامل

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...

متن کامل

Acoustic and Language Modeling for Czech ASR in MALACH

متن کامل

شبکه عصبی پیچشی با پنجره‌های قابل تطبیق برای بازشناسی گفتار

Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...

متن کامل

F-TRANSFORM FOR NUMERICAL SOLUTION OF TWO-POINT BOUNDARY VALUE PROBLEM

We propose a fuzzy-based approach aiming at finding numerical solutions to some classical problems. We use the technique of F-transform to solve a second-order ordinary differential equation with boundary conditions. We reduce the problem to a system of linear equations and make experiments that demonstrate applicability of the proposed method. We estimate the order of accuracy of the proposed ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2008

Advances in Acoustic Modeling for the Recognition of Czech

نویسندگان

چکیده

منابع مشابه

Allophone-based acoustic modeling for Persian phoneme recognition

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Acoustic and Language Modeling for Czech ASR in MALACH

شبکه عصبی پیچشی با پنجره‌های قابل تطبیق برای بازشناسی گفتار

F-TRANSFORM FOR NUMERICAL SOLUTION OF TWO-POINT BOUNDARY VALUE PROBLEM

عنوان ژورنال:

اشتراک گذاری